Weighted parallel model combination for noisy speech recognition

نویسندگان

Tai-Hwei Hwang

Hsiao-Chuan Wang

چکیده

This paper proposes a modified parameter mapping scheme for parallel model combination (PMC) method. The modification aims to improve the discriminative capabilities of the compensated models. It is achieved by the rearrangement of the distributions of state models in order to emphasize the contribution of the mean in the following process. Both distributions of speech model and noise model are shaped in cepstral domain through a covariance contracting procedure. After the compensation steps, an expanding procedure of the adapted covariance is necessary to release the emphasis. Using this process, the discriminative capability is increased so that the recognition accuracy is improved. In this paper, the recognition of Chinese names demonstrates the improvement to the original PMC method, especially when SNR is low.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...

متن کامل

Improved parallel model combination based on better domain transformation for speech recognition under noisy environments

The parallel model combination (PMC) technique has been shown to achieve very good performance for speech recognition under noisy conditions. However, there still exist some problems based on the PMC formula. In this paper, we first investigated these problems and some modifications on the transformation process of PMC were proposed. Experimental results show that this modified PMC can provide ...

متن کامل

Improved robustness for speech recognition under noisy conditions using correlated parallel model combination

The parallel model combination (PMC) technique has been shown to achieve very good performance for speech recognition under noisy conditions. In this approach, the speech signal and the noise are assumed uncorrelated during modeling. In this paper, a new correlated PMC is proposed by properly estimating and modeling the nonzero correlation between the speech signal and the noise. Preliminary ex...

متن کامل

Speech recognition in noisy environment using weighted projection-based likelihood measure

This paper investigates a projection-based likelihood meaure that improves speech recognition performance in noisy environment. The projection-based likelihood measure is modi ed to give the weighting and projection e ect and to reduce computational complexity. It is evaluated in sub-model based word recognition using semi-continuous hidden Markov model with speaker independent mode. Experiment...

متن کامل

An Information-Theoretic Discussion of Convolutional Bottleneck Features for Robust Speech Recognition

Convolutional Neural Networks (CNNs) have been shown their performance in speech recognition systems for extracting features, and also acoustic modeling. In addition, CNNs have been used for robust speech recognition and competitive results have been reported. Convolutive Bottleneck Network (CBN) is a kind of CNNs which has a bottleneck layer among its fully connected layers. The bottleneck fea...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1998

Weighted parallel model combination for noisy speech recognition

نویسندگان

چکیده

منابع مشابه

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

Improved parallel model combination based on better domain transformation for speech recognition under noisy environments

Improved robustness for speech recognition under noisy conditions using correlated parallel model combination

Speech recognition in noisy environment using weighted projection-based likelihood measure

An Information-Theoretic Discussion of Convolutional Bottleneck Features for Robust Speech Recognition

عنوان ژورنال:

اشتراک گذاری